A tight upper bound on the size of the antidictionary of a binary string

نویسندگان

  • Hiroyoshi Morita
  • Takahiro Ota
چکیده

An antidictionary is a set of words that never appear in a binary string. In 2000, Crochemore et al. (2000) presented a compression algorithm of binary text using antidictionary called DCA. Their coding algorithm has been tested on the Calgary Corpus, and their experimental results show that we get compression ratios equivalent to those of most common compressors such as pkzip. Recently, an online source coding scheme based on DCA is presented to apply for compressing losslessly ECG (ElectroCardioGram) in Ota and Morita (2004). Experimental results show that their algorithm achieved 10% smaller compression ratio than LZ ones. In this article, we present au upper bound of the size of the antidictionary of a binary string. The upper bound we obtained is stronger than that in Crochemore et al. (1998). And it is tight in the sense there exists a string to attain the bound. We also proved that the antidictionary of a binary string is always smaller than or equal to that of the dictionary of the same string. Moreover, we give an algorithm to reconstruct the dictionary from the antidictionary. This article is organized as follows. Section 2 gives definitions on antidictionary with some examples. In Sections 3 and 4, we investigate the size of the antidictionary of a given string and derive a tight upper bound on its size. Section 5 presents an algorithm to reconstruct the dictionary from the antidictionary of a given string and Section 6 summarizes our results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sharp Upper bounds for Multiplicative Version of Degree Distance and Multiplicative Version of Gutman Index of Some Products of Graphs

In $1994,$ degree distance  of a graph was introduced by Dobrynin, Kochetova and Gutman. And Gutman proposed the Gutman index of a graph in $1994.$ In this paper, we introduce the concepts of  multiplicative version of degree distance and the multiplicative version of Gutman index of a graph. We find the sharp upper bound for the  multiplicative version of degree distance and multiplicative ver...

متن کامل

An Upper Bound Approach for Analysis of Hydroforming of Sheet Metals

Considering a kinematical velocity admissible field, the upper bound method has beenused for predicting the amount of pressure in hydroforming of sheet metals. The effects of workhardening, friction and blank size have been considered in pressure prediction. Also the effect ofsheet thickness variation has been considered in the present work formulations. The relation betweenpressure and punch s...

متن کامل

Relations Between Greedy and Bit-Optimal LZ77 Encodings

This paper investigates the size in bits of the LZ77 encoding, which is the most popular and efficient variant of the Lempel–Ziv encodings used in data compression. We prove that, for a wide natural class of variable-length encoders for LZ77 phrases, the size of the greedily constructed LZ77 encoding on constant alphabets is within a factor O( logn log log logn ) of the optimal LZ77 encoding, w...

متن کامل

The (non-)existence of perfect codes in Lucas cubes

A Fibonacci string of length $n$ is a binary string $b = b_1b_2ldots b_n$ in which for every $1 leq i < n$, $b_icdot b_{i+1} = 0$. In other words, a Fibonacci string is a binary string without 11 as a substring. Similarly, a Lucas string is a Fibonacci string $b_1b_2ldots b_n$ that $b_1cdot b_n = 0$. For a natural number $ngeq1$, a Fibonacci cube of dimension $n$ is denoted by $Gamma_n$ and i...

متن کامل

A generalized upper bound solution for bimetallic rod extrusion through arbitrarily curved dies

In this paper, an upper bound approach is used to analyze the extrusion process of bimetallic rods through arbitrarily curved dies. Based on a spherical velocity field, internal, shearing and frictional power terms are calculated. The developed upper bound solution is used for calculating the extrusion force for two types of die shapes: a conical die as a linear die profile and a streamlined di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005